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Abstract —This paper studies an online algorithm for an energy 
harvesting transmitter, where the transmission (completion) time 
is considered as the system performance. Unlike the existing 
online algorithms which more or less require the knowledge on 
the future behavior of the energy-harvesting rate, we consider 
a practical but significantly more challenging scenario where 
the energy-harvesting rate is assumed to be totally unknown. 
Our design is formulated as a robust-optimal control problem 
which aims to optimize the worst-case performance. The transmit 
power is designed only based on the current battery energy level 
and the data queue length directly monitored by the transmitter 
itself. Specifically, we apply an event-trigger approach in which 
the transmitter continuously monitors the battery energy and 
triggers an event when a significant change occurs. Once an 
event is triggered, the transmit power is updated according to 
the solution to the robust-optimal control problem, which is 
given in a simple analytic form. We present numerical results 
on the transmission time achieved by the proposed design and 
demonstrate its robust-optimality. 

Index Terms —Energy harvesting, online algorithm, event trig¬ 
ger, robust-optimal control, transmission-time minimization. 


were studied. By modeling the energy arrival process as a com¬ 
pound Poisson process, ng proposed a throughput-optimal 
transmission policy with a deadline in the continuous-time 
domain by dynamic programming. In pT) , the energy arrival 
process was formulated as first-order stationary Markov model 
and the finite-horizon throughput-optimal transmission policy 
was derived. Aiming at minimizing the delay, provided 
a closed-form design for the transmission policy which has a 
multi-level water-filling structure. Without explicitly modeling 
the energy and data arrival processes, employed an upper 
bound on the long-term data loss ratio and a threshold on 
the frequency of visits to zero battery state to give a near 
throughput-optimal transmission policy. Although the majority 
of the designed transmission policies aim at either maximizing 
the throughput or minimizing the delay, other studies also 
considered maximizing the communication reliability G3 or 
minimizing the energy consumption 115|. 


I. Introduction 
A. Motivation and Related Work 

The development of energy harvesting devices has attracted 
significant attention in recent years with many potential ap¬ 
plications in communication networks for green and self- 
sustainable communications 0 - In order to make efficient 
use of harvested energy, both offline and online solutions 
have been investigated for designing the optimal transmission 
policy. Offline solutions are possible in highly predictable en¬ 
vironments where the energy and data arrivals in a sufficiently 
distant future (for communication purpose) can be accurately 
estimated @-0- On the other hand, online solutions typically 
reduce the dependence on the future knowledge of the energy 
and data arrival processes, and hence are more applicable in 
practice. The online algorithms can be roughly categorized 
into two frameworks as follows: 

In the first framework, the statistical parameters (e.g., the 
expectation) of energy and data arrival processes are known, 
and the designs of online transmission policies for energy har¬ 
vesting nodes are often stated as stochastic control problems. 
In ®, 0, the energy and data arrival processes were modeled 
as stationary and ergodic stochastic processes, where the 
throughput-optimal and delay-optimal transmission policies 


The second framework uses parameter-independent method¬ 
ologies. In II6)> the energy and data arrival processes were 
formulated as time-homogeneous Markov chains without 
knowing the transition matrix, and Q-learning was applied 
to perform online optimization on the transmission policy. 
By Lyapunov optimization technique combined with the 
idea of weight perturbation, d) proposed a generic utility- 
maximization policy, under the assumption that the amount 
of harvested energy in each time slot is independent and 
identically distributed (i.i.d.) but its statistical parameters are 
totally unknown. Although these parameter-independent poli¬ 
cies require less knowledge on the energy and data arrival 
processes, the stochastic models of the energy and data arrival 
processes still need to be known exactly. 

In practical scenario, the factors determining the energy ar¬ 
rivals are complex, dynamically changing and often unknown 
to the system designer. It is sometimes even difficult to come 
up with accurate models for the energy-harvesting rate. This 
leads to an interesting and practical design problem: how to 
design and implement an online transmission policy for energy 
harvesting nodes without imposing any assumption on the 
future behaviors of the energy-harvesting rate? In this paper, 
we aim to provide an answer to this important question. 
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taintiesj^ We use it to ensure that the system performance 
(i.e., the transmission time) is no worse than a level (the 
optimized worst-case performance), no matter what kind of 
energy arrival process is imposed. The other method is the 
event-trigger based control pO) (or aperiodic control). It can 
significantly reduce the unnecessary computations compared 
to the traditional periodic control (i.e., time-slotted control^ 
Our paper is mostly related to the recent work in 0- 
| [2^ , which considered the similar assumptions that the future 
energy arrival is unknown. In pl]-|[23), the competitive analy¬ 
sis p4) was employed to minimize the gap between online and 
offline performances. However, minimizing this gap cannot 
directly guarantee a certain system performance (e.g., optimal 
worst-case performance) of online algorithms. Additionally, 
these recent studies still considered time-slotted systems, and 
hence, the transmission protocol is updated in every time slot, 
regardless of the change in the amount of energy available. 


B. Our Contributions 

In this work, we study the performance of an energy¬ 
harvesting transmitter measured by the transmission (com¬ 
pletion) time, i.e., the time duration it takes to complete 
the transmission of a given amount of data. We propose to 
use event-trigger based design to control the transmit power 
without any knowledge on the future behavior of the energy¬ 
harvesting rate. In the considered scenario, it is not possible 
to use any statistics of the transmission time in the design. 
Hence, we adopt the robust-optimal control to minimize 
the worst-case transmission time. Nevertheless, the minimum 
worst-case transmission time may not always be finite. When 
the minimum worst-case transmission time is infinite, which 
happens when too much data is given to be transmitted with 
insufficient initial battery energy, we measure the robust- 
optimality of the transmit power design by looking at the set of 
energy-harvesting rates that result in finite transmission times. 
The robust-optimal design ensures the largest set of energy¬ 
harvesting rates resulting in finite transmission times. 

The proposed event-trigger based transmitter has two build¬ 
ing blocks for implementing the transmit power, namely an 
Event Detector (ED) and a Transmission Planner (TP). The 
ED continuously monitors the battery energy and triggers a 
new event when it experiences some significant change since 
the last event. Whenever an event is triggered, the TP uses 
the current knowledge of battery energy and data queue to 
update the transmit power by robust-optimal control. The 
updated transmit power is implemented until the next event 
is triggered. To the best of our knowledge, this is the first 

*The term “robust optimality” originates from the robust optimization (e.g., 
in |19| ). It refers to the optimization of an objective over an uncertain set of 
situations such that the objective is always not worse than the optimized level. 
If all the variables in one robust optimization problem are functions of time, 
then this problem becomes a robust-optimal control problem. 

^When designing communication protocol for energy harvesting transmit¬ 
ters, the transmission policy is designed according to the amount of energy and 
data available. The event-trigger based control updates the transmission policy 
only when there is a notable change in the amount of energy. In contrast, the 
traditional time-slotted control always performs computation to update the 
transmission policy at regular time intervals. 


time that the event-trigger based design is implemented on 
energy harvesting transmitters. 

To facilitate the robust-optimal design, we first give a 
comprehensive analysis on the behavior of battery energy and 
data queue in each triggered event. Specifically, we define 
the reachable set, which describes all possible states (battery 
energy and data queue) reachable in one event based on the 
TP’s knowledge, and reflects the relationship among battery 
energy, data queue and transmit power. Base on these analyses, 
we derive the solution of the robust-optimal transmit power 
design, given in a simple analytic form. 


C. Paper Organization and Notation 

In Section the system model is given and the event- 
trigger based transmission is introduced. In Section III the 
problem of finding the robust-optimal transmit power design is 
defined. We study the properties of the proposed event-trigger 
based system through the reachable set analysis in Section [TV 
The optimal solution to the problem is given in Section |V 
In Section |Vl] simulation results are shown to illustrate the 
effectiveness of our design and corroborate our theoretical 


results. Einally, conclusion is drawn in Section VII 

Throughout this paper, IR+, IR_|_, and denote the sets of 
non-negative real numbers, positive real numbers, and positive 
integers. X(t) denotes the time derivative of X{t) which is 
a function of time. /i(S') is the Lebesgue measure of set S. 
The restriction (Page 36 in p5| ) of function / to domain A 
is /U- For a; S M, [a;]+ returns max{a;,0}. 


H. System Model and Event-Trigger Approach 
A. System Model 

We consider a transmitter-receiver pair as shown in Eig. 

At time t G [to, c»), where to is the starting time of the 
communication, the transmitter has a battery with energy 
C(t) G M+. The harvested energy is stored in the battery with 
energy-harvesting rate N : [to, oo) —1R+. We assume H, as a 
function of time, is Lebesgue integrable over any subset of M+ 
with finite measure, and all such H form the set T-L. The trans¬ 
mit power at time instant t is p(t) G [0, Pmax], which is also 
Lebesgue integrable over any subset of K+ with finite measure, 
and Pmax denotes the maximum power constraint. Then, the 
relationship among battery energy, energy-harvesting rate and 
transmit power is given by a differential equation 

E{t)=H{t)-p{t), (1) 

where the initial battery energy is i?(fo) € In this pa¬ 
per, we consider the transmission-time minimization problem, 
where all the data to be transmitted is available at fo- The 
data queue is Q{t) G K+, and the transmission rate at t is 
r(t) G M+, which is Lebesgue integrable over any subset 
of ]R_|. with finite measure, and the relationship between data 
queue length and transmission rate is 

Q(t) = -r(t), (2) 

where the initial data queue is Q(to) G K+. Equation Q 
means all the amount of data (equal to Q(to)) to be trans¬ 
mitted is available at tg, and there is no subsequent data 





3 



Fig. 1. The event-trigger based energy harvesting transmitter. 

arrival in {to, oo), which is a commonly used assumption 
for transmission-time minimization problem (see 0 for an 
example). The other system assumptions are as follows: 

• The energy-harvesting rate H is totally unknown. 

• Battery energy E{t) can be measured at current time t. 

• The channel is assumed to be static such that the channel 
capacity is C{t) = log 2 (l +p{t)). 

Remark 1. Unlike most existing online algorithms which 
somewhat included the prior-knowledge on H, we aim to 
derive an online algorithm without any knowledge of H. Ad¬ 
ditionally, the battery energy can be monitored continuously^ 
We consider a simple static channel that is only affected by 
AWGN. Note that the static-channel assumption is widely used 
in the literature, e.g., & & 

Note that the transmit power p{t) and the transmission rate 
r{t) are to be designed by us, and in this paper, we assume 
the channel capacity is achieved, i.e., r(t) = C{f) = log 2 (l + 
p{t)), which means at each time t the transmission rate is a 
function of transmit power, labeled as r{p{t)). Now, the only 
variable to be designed is the mapping p : 1p{t)- 

B. Event-Trigger Based Control 

As shown in Fig. [T] the event-trigger control relies on an 
Event Detector (ED) that detects the necessary changes in the 
cumulated energy; and a Transmission Planner (TP) that gives 
the design for transmit power p{t). 

We give the condition under which an event is triggered. 

Definition 1 (Triggering Condition). From a given time instant 
s, an event is triggered at t (t > s) whenever the following 
inequalities is satisfied 

f H{T)dT E{t) — E{s)-\- f p(r)dT > £, (3) 

J S J S 

where e S M+ is the triggering threshold. 

In Definition [T] condition (|^ means that when the harvested 
energy cumulates over a certain level e, the ED triggers a new 
event. Even though H{t) is unknown, the integrals of E[{t) 
can be calculated by equality (a) in (|^, which is derived by 

^The technology for continuously monitoring the battery energy with 
minimal operating energy consumption has been developed over the past 
decades (e.g., |26| ). Hence, we assume that this operating energy can be 
neglected as compared to the energy consumption for transmission. 


the solutions of Q, where Eif) is observable. The transmit 
power p{t) for r S (s, f) is determined by the TP (to be 
discussed in later part of this subsection). 

Recall that the transmission is carried out over the entire 
communication time interval [to, oo). At the initial time 
instant to, the ED triggers the start of the transmission. Then, 
the ED will start monitoring the system on [fg, t], where t is 
the current time instant. We label the first time instant (after 
to) at which the system satisfies the triggering condition in 
Definition [T] by ti. After ti, the ED will start monitoring the 
system on {ti, t]. The next time instant at which an event is 
triggered is labeled as t2, and so on. Eor convenience, we say 
that event n starts at and finishes at tn+i. This completes 
our description of the ED. 

Whenever an event comes, the TP plans the transmit power 
to be implemented from the current time instant until the 
next event arrives, and we analyze event n without loss of 
generality. It is important to note that the TP only takes into 
account the information available at the beginning of the event 
when planning the transmit power. Such information includes 
the battery energy and data queue length at f„, i.e., E{tn) and 
Q{tn). However, any future change due to energy-harvesting 
rate, i.e., H{t) for t > tn, cannot be taken into account, 
simply because H{t) for t > tn is unknown to the TP at 
tn. Specifically, at the beginning of the n* event, i.e., at the 
time instant f„, the TP records the values of E{tn) and Q{tn), 
and designs p{t) to be implemented over (f„, f„+i] using 0 
and 0 with H{t) = 0. This does not mean the TP neglects 
the effect from H{t) all the time, because H{t) determines 
the arrival time of the next event, by the triggering condition. 

In each event n, the TP plans the transmit power for a finite 
time window after tn, and we call it as the Planned Transmit 
Power (PTP): 

Pn,€ ■ in T ^n] ^ [0: Pm ax] 7 (4) 

where the subscript £ means the PTP is designed under a given 
triggering threshold e, and the duration of the time window 
Tn is called as the planned transmission time. In (|^, the PTP 
is described as a mapping from a time instant to a value of 
transmit power. Eor any t G {tn, -|-T„], Pn,e{i) is the value 
of transmit power at time t. Thus, a PTP can be described 
by two parameters, i.e., Pn,e{i) and Tn. Eor example, the 
PTP pn,e{i) = |sinf|, t G {tn, tn + 1] is described by pair 
{Pn,e{i) ,Tn) = (| sinf|, 1). All possible pn,e compose the PTP 
set %)n,£- The design problem here is to find a good mapping 
t Pn,e{t) in 0. Note that T„ is part of the description 
of a PTP, and in other words, the design of ^ effectively 
includes the design of r„. We will see in Section |V] that T„ 
plays an important role in designing the optimal Pn,e- 

Ideally, the PTP should be implemented over {tn, tn -\-Tn]. 
However, the TP performs the design at based on the current 
information {E{tn) and Q{tn)), and hence, cannot predict the 
exact value of f„+i, i.e., it does not know when the next 
event occurs. As a result, the PTP in event n will not be 
implemented beyond tn+i because a new PTP will be planned 
and implemented after t„+i. Hence, the actual transmission 
time is min{T„, tn+i — tn}. It implies that when the (n-f 1)* 
event comes, the TP will use the newly designed PTP for 
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event n + 1, even if the planned transmission for event n is 
unfinished. Therefore, the actual transmit power, denoted by 
Pg is implemented piecewise by Pn,e\(t t +i] (restricting g 
to {tn, fra+i]) for each event n. Specihcally, the relationship 
between the actual transmit power and the FTP is 

Fe(f) ~Fn,e(f); f ^ (fn; fn+l]- (5) 

For a given triggering threshold e, all such pg : f i—> pg(f) 
in Q form the set of all possible transmit power fPg. Recall 
that pg is Lebesgue integrable over any subset of IR_|_ with 
hnite measure. Thus, ‘Pg is a set of non-negative Lebesgue 
integrable functions over any subset of ]R+ with hnite measure. 

To sum up, the event-trigger control framework is illustrated 
in Algorithm where Lines and are the very part to be 
designed in the rest of this paper. 

Algorithm 1 Event-Trigger Based Control 

1: Initial Condition: t = tg, t„ = to, E(t) = E(to), Q{t) = Q{to), 
E{t„) = E{to), and Q{tn) = Q(io). 

2: Assuming = 0 for t > the TP design the PTP Pn,e (see |^); 
3: while Q{t) >0 do 

4: The ED updates E{t) and Q{t) (the update frequency is dependent 

on the chip’s clock) and checks the condition in with s = t„', 

5: if Condition is satisfied then 

6: The ED triggers an event to activate the TP; 

7: The TP updates tn = t, E{tn) = E{t), and Q{tn) = Q(t); 

8: Assuming E[(i) = 0 for t > tn, the TP design the PTP Pn,e for 

this event; 

9; The transmitter uses the newly designed PTP as the transmit power, 

i.e., p(f) = Pn,e(t); 

10: else 

11: TP is inactive; 

12: The transmitter uses the most recent PTP as the transmit power, 

i.e., p(t) = p„,e(4); 

13: end if 

14: end while 


III. Problem Description 

In this paper, we study the transmission-time minimization 
problem under unknown energy-harvesting rate H. The trans¬ 
mission time is the time spent by the transmitter to clear up the 
data queue, and we label it as T{pe, E{tQ),Q{tQ), H), which 
is dependent on the transmit power design : t i—t Pe{t), the 
initial battery energy E{to), the initial data queue Q{tQ), and 
the energy-harvesting rate H. 

Since H is totally unknown, given pg, E{t[j), and Qito), the 
transmission time varies with different E[ G H. As a result, 
the transmission time is within the following range 

™{,T{Pe,E{to),Q{to),H) < TiPe,E{to),Q{to),H) 

ri 

< sup TiPe,E{to),Qito),H). (6) 
Hen 

Note that it is impossible to compute the average (or other 
statistical properties of) transmission time for any given design 
because the statistics of E[ are totally unknown. Nevertheless, 
it is possible to examine the worst-case transmission time for 
any given design, i.e., supj^g^ T(pe, E{to),Q{to), H). Thus, 
our approach is to hnd a design that achieves the minimum 
worst-case transmission time. In other words, our design aims 
to give the best performance under the worst-case scenario. 


Based on this idea, the transmission-time-minimization prob¬ 
lem is dehned in Subproblem 

Subproblem 1 (Transmission-Time-Minimization Problem). 
Given initial battery energy E(to), initial data queue Q{to), 
and triggering threshold e, design PTP p„ g in each event n, 
with the knowledge of E{tn) and Q{tn), such that 

T* = inf sup TiPe,Eito),Q{to),H), (7) 

PeGqJe Hen 

where the actual transmit power pg is determined by the PTP 
Pn,e tis shown in 

Remark 2. Indeed, the idea of defining Subproblem is 
borrowed from robust-optimal control HI8^ : In (|^, the sup 
operator returns the worst transmission time for a given pg 
under its corresponding worst-case energy-harvesting rate 
Hp^; while the inf operator reflects our aim of designing p* 
whose worst transmission time T {p*g,E(tf),Q{tf),Hp*^ is 
the smallest^One technical challenge in solving Subproblem^ 
is that the worst-case energy-harvesting rate depends on the 
choice of transmit power, i.e., for different pg, the worst-case 
Hp,, can be different. 

It is important to note that T* is not always hnite. In the 
case T* = oo, equation (0 can hardly measure the robust 
optimality on the designed pg, since for any pg the worst- 
case transmission time is always inhnite. Hence, SubprobIem[T] 
is not sufficient for describing all scenarios and we need 
a different problem formulation to deal with the case of 
T* = 00 as explained as follows: For a given pg, there 
should exist some energy-harvesting rate H resulting in a hnite 
transmission time, even though the worst-case Hp,, may lead 
to an inhnite transmission time. All possible such energy¬ 
harvesting rates form the hnite-transmission-time energy set 
TLf{pe,E{to),Q{to)) C H, dehned in Dehnition]^ 

Definition 2 (Finite-Transmission-Time Energy Set). Given 
Pg, E(to), and Q(to), the finite-transmission-time energy set 
'EfiPe,E{to),Q{to)) is 

{H:TiPe,E{to),Qito),H) <^, H G H} . (8) 

Considering two transmit power designs, denoted by Pg 
and Pg, whose worst-case transmission times are inhnite, 
we can say that Pg is more robust than p^ if the hnite- 
transmission-time energy set of Pg (i.e., 7f/(pg, E{to), Q(to))) 
is larger than that of Pg (i.e., ?{f(p^,E(to),Q(to))). This 
is because 'Hf{p^,E{tQ),Q{to)) is more likely to re¬ 
sult in a hnite transmission time in the actual transmis¬ 
sion. This motivates us to hnd the transmit power pg 
with the largest 'H/(pg, Q(fo)) such that any other 

T-Lf{p'^,E{to),Q{to)) is its subset, when T* = oo. 

Subproblem 2 (Energy-Set-Maximization Problem). Given 
initial battery energy Eff), initial data queue Q{tf), and 
triggering threshold e, if T* = oo, design PTP pn,e in each 

'‘The subscript pg in Hp,, highlights the fact that the worst-case H depends 
on the given transmit power. Although, mathematically Hp^ (for any pg) 
might not exist in P, since the operator in ^ is sup rather than max, the 
existence of the optimal transmit power design p* is given in Theorem [T| 
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event n, with the knowledge of E(tn) and Q{tn), such that 
Hfip',,Eito),Q{to)) C Hfip„E{to),Q{to)), W, G 

(9) 

As a summary of Subproblem [T] and Subproblem the 
robust-optimal transmit power should: achieve the minimum 
transmission time T* under the worst-case energy-harvesting 
rate, ifT* is finite; otherwise, ensure the largest set of H that 
results in a finite transmission time, if T* is infinite. Putting 
these two subproblems altogether, we define the Robust- 
Transmission-Time (RTT) problem as follows. 

Problem 1 (Robust-Transmission-Time Problem). Given ini¬ 
tial battery energy E{to), initial data queue Q{to), and 
triggering threshold e, design FTP g in each event n, with 
the knowledge of E{tn) and Q{tn), such that 

{ Pe satisfies 0, if T* < 00, 

Pe satisfies 0, if T* = oo, 

where the relationship between transmit power p^ and the FTP 
Pn,e f'S given in 0. 

In the rest of this paper, we focus on how to solve the RTT 
problem with the event-trigger based control. 


IV. Reachable Set Analysis 


Since our proposed control method is event-trigger based 
(see Section II-B| l, i.e., the transmit power is piecewise imple¬ 
mented in each event, in this section, we analyze all reachable 
battery energy and data queue (the system states) in each event 
n. We stress that the optimal solution of the RTT problem 
is highly dependent on the structure of the reachable set of 
battery energy and data queue. 

Recall that during event n the TP only takes into account the 
information of battery energy and data queue at and ignores 
the energy-harvesting rate H(t) for t '> tji- The battery energy 
and data queue seen by the TP behave as 


En{t) = E{tn) - Pn,E('r)dr, 
Quit) = Q{tn) - r {pn,e(T)) dr, 


( 11 ) 


where En{t) and Qn{t) refer to the dynamics of the battery 
energy and data queue known by the TP based on its available 
information (i.e., E(tn) and Q{tn) but not H{t) for t > tn), 
which are distinct from the actual battery energy E{t) and 
data queue (3(f)|^ Since in each event the PIP is designed by 
the TP, we should analyze the property of En{t) and Qn{t) 
rather than E{f) and Q{f). This is because the robust-optimal 
control of the transmit power can only be designed according 
to what the TP knows, i.e., En{t) and Qn{t)- 

Now, we define the reachable set of our interest. The 
reachable set contains all reachable states {En{t), Qn{t)) after 
implementing the PTP over the planned transmission time in 
the n* event, i.e., all reachable -b Tn), Qnifn + Tn)). 


_^From we know that the relationship between E„{t) and E(t) is 
En{t) G F{t) G En(t) + £ for t £ [tn, tn+i]. The relationship between 
Qn{t) and Q{t) is Q„{t) = Q{t) for t S [tn, tn+i]. 



Fig. 2. Illustrations of reachable set in the En — Qn (two-dimensional) 
region for event n: point a denotes (E{tn),Q{tn)), and b, c as well as d 
are different end points (^En, Qn) in the reachable set IH„ (blue region). The 
an'ows with numbers 1 and 2 are two different paths from a to b. The arrows 
with numbers 3 and 4 are the paths from a to c and d, respectively. 


Note that the planned transmission time Tn is generally 
different for different PTPs. 

Fig. 1^ gives a pictorial illustration ofjhe readable set: At 
t = tn, the system state is at point {En{tn),Qn{tn)) (i-e., 
point a). After tn, the PTP pushes the system state (seen by 
the TP) to move along the aiTow (different PTPs correspond to 
different arrows). Alt = tn-\-Tn, the state stops at a point (e.g., 
point b) which corresponds to -b Tn),Qn{tn + Tn)). 

All [En{tn + Tn),Qn{tn + Tn)) compose the reachable set 
(i.e., the shaded area). We define the reachable set as follows. 

Definition 3 (Reachable Set). From given E(tn) and Q{tn), 
the reachable set in event is 

=1 (Jlln, Qn'^ ■ En = E{tn) - j Pn,e{T)dT > 0, 

_ rtn-\-T'rL 

Qn=Q{tn)- J r{pn^g{T))dT > 0, 

Pn,€ G ^n,€, Tn < OO j", 

where we use {En,Qn) rather than {En{t),Qn{t)) to repre¬ 
sent the point in as [En{tn + Tn), Qn{tn + Tn)) can be 
the same with different Tn, which violates the definition of set. 

In Fig. 1^ we can see that different PTPs correspond to 
different paths or arrows in the figure, which may or may not 
arrive at the same end point. 

Even though Definition gives an expression of the reach¬ 
able set, it is too abstract and not convenient for design. To 
give a more explicit form of reachable set, we define the Rate- 
Power Equilibrium (RPE) to help the subsequent analysis. 

Definition 4 (Rate-Power Equilibrium (RPE)). The rate-power 
line is defined in the r — p plane (see Fig. 0.' 

r = KnP, where Kn = -2^. (12) 

E{tn) - En 

The intersection of rate-power line and rate function r = 
log 2 (l +p) for p G (0, Pmax] (here, p is a scalar) is called 
the RPE, and the corresponding transmit power of the RPE 
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Fig. 3. Rate-power line, rate function and RPE, where r{p) = log 2 (l -I- p) 
and K!^ > ffmax > K'^ > Xmin > K”'. For K'^ and Ffmax, no positive 
intersections exist due to the large Kn- For X", ii’min and K!^', positive 
intersections exist because of the small K„: Points a as well as b are RPEs, 
but point c is not a RPE due to p® > pmax- We can see that p® decreases 
as K„ going large. 


is labeled by . For En = E(tn) or Qn = Q{tn), we define 
their p® = 0, even though no RPE exists. 

J^rom Definition we see that any arbitrary pair of values 
{En,Qn) has a corresponding rate-power line in the r — p 
plane. Because of the concavity of the rate function, there 
exists at most one RPE for an arbitrary Qn) - If the RPE 
exists, we can use the following remark to calculate it. 

Remark 3. Solving KnPn = log 2 (l +Pra)> we have 

= (13) 

where W-i is the real valued Lambert W function ^28^ in 
the lower branch (W < — Ij. Flere the RPE is the point 
(Pn^ log 2 (l +pI)) in the r-p plane. 


Proof: See Appendix [B| _ _ * 

Proposition 1 means that the point (En,Qn) is in the 
reachable set if and only if the corresponding Kn (i.e., the 
slope of the corresponding rate-power line) is within a certain 
range. As shown in Pig.|^ the slope Kn decreases as p® grows, 
which implies; On the one hand, A'max is the supremum of the 
slope Kn to have an intersection between the rate-power line 
and the rate function (i.e., to have a RPE). On the other hand, 
due to the maximum power constraint, ATmin is the minimum 
slope to have a RPE. 

A pictorial illustration of the reachable set is shown in 
Fig. 0 and it can be easily categorized into three cases, 
depending on the relationship among ATmin, Ffmax, and Kn^^ 
(called the energy-balanced slope), where 


K'^^^ := 


Q{tn) - 0 Q{tn) 


(17) 


E(tn)-Q Eifn)' 

To be more specific. Fig. |4(a)| Fig. |4(b)| and Fig. 4(c) 
correspond to < ATmin, Ffmin < < Ffmax and 

> ATiiiax, respectively. These three cases have important 
physical meanings, given in the following remark. 


Remark 4 (Categorization of R^aclwble Sets). In Fig. 4(a)\ 
En is always greater than 0, V(A„,Qra) € which implies 
that the battery energy is abundant. This means that when the 
data queue is cleared, there is still battery energy remaining, 
no matter what PTP is used. In this case, we say that is 


energy-abundant. In Fig. 4(b) origin o is in the reachable set, 
which means the data queue can be cleared by using all the 
energy stored in the battery, and in this case, we say that 
is energy-balanced. In Fig. \4(c)\ Qn is always greater than 
0, y{EmQn) G which means the data queue cannot be 
cleared with the available battery energy, no matter what PTP 
is employed. In this case, we say that 9l„ is energy-scarce. 


The following lemma makes the link between the reachable 
set and the RPE, which helps us to find an explicit expression 
of reachable set in order to facilitate transmit power design. 

Lemma 1 (Criterion on Points in Reachable Set). 

(a„, Qn) G Q(fn))} (14) 

if and only if the RPE for [En,Qn) exists. 

Proof: See Appendix [A| ■ 

Lemma tells that except for {E(tn),Q{tn)), any point 
in reachable set has a RPE, and any point which has a RPE 
must be in the reachable set. Based on Lemma [T] an explicit 
expression of reachable set can be given. 


Proposition 1 (Expression for Reachable Set). Given 
{E(tn),Q(tn)), the reachable set satisfies 


SH„\{(A(f„), Q(tn))} = I (^En, Qn) ■ Kmin <Kn< Kmax, 
0<En<E{tn),0<Qn<Q{tn)} , (15) 

where Kn is a function of En and Qn given in 0. and 

(16) 


_ r(p„iax) „ ,. r(x) 

Ft min ■— j Ft max ■— lim 

a:->- 0 + X 


1 

In 2 


The RPE not only helps to shape the reachable set (see 
Proposition [^1, but also gives the time-optimal PTP. Recall 
that from one starting point (A(f„), Q{tn)), there are multiple 
PTPs that reach the same end point (^En,Qn) (see Fig. 2i. 
These PTPs, however, spend different amount of planned 
transmission time Tn. Hence, we need to hnd the time-optimal 


PTP that has the minimum Tn for each point yEn,Qnj G fHn. 

For a given starting point (A(f„), Q{tn)) and an end point 
{En,Qn), the planned transmission time : '^n,e I^-i- 

is a non-negative functional of the PTP, and the time-optimal 
PTP has the smallest T„ (and we mark the minimum planned 
transmission time as T_n\En,Qn\ for end point {En,Qn)), 
i.e.. 


T.n[^n,Qn] — 


_inf^ 

S<^Vn,e[Er.,Qr 


Tn{Pn,e), (18) 


where ^n,e[EmQn\ stands for those PTPs to make the end 
point as [En,Qn)- The following proposition shows that the 
transmit power in the time-optimal PTP for a given pair of 
starting point and end point is unique and remains constant 
at the value of p® (the transmit power corresponding to the 
RPE) over the planned transmission time T„. 


Pmax 
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Fig. 4. Shapes of 3 cases of reachable sets (including the starting points a, d, h), where Li, L 2 , and L 3 correspond to lines Qn — Q{tn) = ^min {En—E{tn)), 
Qn - Q{tn) = Kmax{En — E{tn)), and Qn - Q(fri) = K^al{En - E{tn)), respectively, (a) Energy-abundant case < Kmin), point b is in the 

reachable set, whereas point c is not in the reachable set. (b) Energy-balanced case ^ < i^max)^ point e is in the reachable set, while points 

/ and g are not in the reachable set. (c) Energy-scarce case > F^max), point j is in the reachable set, but point i is not in the reachable set. 


Proposition 2 (Time-Optimal PTP). y{En,Qn) G 
^n\{{E{tn),Q{tn))}, the Unique time-optimal PTP to 
achieve Tj^[En,Qn\ E with parameters 

{PnAt)^T^) = {Pu,Tn[Eu,Qn]) , ( 19 ) 

where p® is the transmit power of the corresponding RPE 
which can be calculated by 03 in Remark^ and the minimum 
planned transmission time is: 


T.n [Em Qn\ — 


Qjtn) - Qn _ E{tn) - E^ 
r (Pn) Pu 


( 20 ) 


Proof: See Appendix [C] ■ 

With Proposition we can calculate the time-optimal 
PTP through (|T9] l to minimize the planned transmission 
time T_^\EmQ^ for any point in reachable set ex¬ 
cept for {E{tn),Q{tn))- But obviously, the optimal time 
for {E{tn),Q{tn)) is = 0. Note that 

different end points {En,Qn) correspond to different 
T^[E(tn)-,Q(tn)]. If the TP wants to clear the data queue 
with a minimum planned transmission time, it is equivalent to 
consider the end points with Qn = 0 and select one from them 
which has the minimum Tn[En,Qn]- This result is given in 
Corollary 

Corollary 1. If < ATmin, we have 


argmin r„[A;„,Q„] = (E{tn) - , (21) 

(B„,Q„)G3t„.Q„=0 2 

whose Pn (see Proposition § is p^ax- If iTmin < < 

ATmax, we have 


argmin [A„, Q„] = (0,0), (22) 

(-E„,Q„)G9t„,Q„=0 


whose Pn is labeled as which has the following form: 


where AT^*^' is given in (13 and W-i is th e real valued Lam¬ 
bert W function in the lower branch [281. If > ATmax. 
we cannot find any end point with Qn = 0. 


Proof: See Appendix [P] ■ 

Remark 5. We claim that Corollary plays an important 
role in the solution to the RTT problem, which is shown 
in Theorem More details can be found in the proof of 
Theorem (see Appendix 0- Briefly speaking, the time- 
optimal PTPs corresponding to p® in Corollary give the 
solution to the RTT problem for cases < ATmin cind 

K ■ < < K 


V. Solution to RTT Problem and Discussion on 
Triggering Condition 

The analysis on the reachable set of battery energy and 
data queue in an arbitrary event as well as the result on time- 
optimal PTP have enabled us to solve the RTT problem defined 
in Problem In this section, hrstly, we present the optimal 
solution of the RTT problem, and then discuss the effect of 
the triggering threshold e. 


A. Optimal Solution to RTT Problem 

Theorem 1 (Optimal Solution to RTT Problem). The optimal 
solution of RTT problem is p^^ with the parameterr\ 


{Pn,e{t),Tn) = 



Qjt.) 

Pmax, 

,bal 

7 ^bal J 


< K, 
ATmin < at, 
> K, 


min ? 
bal 


<K„ 


_ -*^^max5 


(24) 


where p^®'^ is given in ( |23| l. The corresponding actual transmit 
power implemented by p^ ^ is labeled as p^. 

Proof: See Appendix]^ ■ 


^Recall that any Pn,e in 0 can be determined by two parameters pn,e{'t) 
and Tn. 

















Recall the event-trigger based framework for transmission 
design summarized in Algorithm The solution to the design 
problem in Lines and is now given in Theorem [T] 


Remark 6. The structure of ^ is easy to understand. The 
first row corresponds to the energy-abundant case for (see 

Fig. 14(0)1 , and in this case, the maximum power is used to 
transmit, since there is enough energy. Likewise, the second 
row stands for the energy-balanced case (see Fig. 4(b)), and 
the corresponding transmit power is which can clear the 
data queue and use up the battery energy at the same time. 
In the third row, the energy-scarce case (see Fig. |4f'c)P , the 
transmitter sends nothing, which can be explained as that any 
transmission in this case would make things worse. 


Remark 7. The worst-case energy-harvesting rate for the 
optimal transmit power design exists and is given by 
F[pR : f i—> 0, i.e., no energy arrival in [fp, oo) (the proof 
is given in Lemma in Appendix H- We denote such no¬ 
energy-arrival case as Flo. It should be noted that the worst- 
case energy-harvesting rate of any given transmission power 
function is not always Hq. Indeed, to determine the worst-case 
energy-harvesting rate of a given transmission power design 
is difficult in general, which is the main difficulty in solving 
the RTT problem. In Section we will show an example 
of a transmit power design of which the worst-case energy¬ 
harvesting rate is very different from Hq. 


B. Discussions on the Triggering Threshold 


In Section V-A the optimal solution of the RTT problem 
is investigated for a given triggering threshold. A natural 
question is that: how does the triggering threshold e affect 
the system behavior (i.e., T* and 'Hf(p^,E(tf),Q(tf)))l In 
this subsection, we give the corresponding answers. 

Firstly, we show that when T* < oo holds, T* is inde¬ 
pendent of the value of e, which is easy to verify, since the 
worst-case H of the robust-optimal solution p. 


is Ho. In 


this worst case, V^i, £2 > 0, we have pf^ = p^^. This is 


because: For Ho, the next event would never be triggered. As 
a result, the actual transmit power is only dependent on the 
FTP designed in event 0 which is not affected by the value of 
the triggering threshold. 

Different from the T* < 00 case, for the T* = 00 case, 
'Hf{pf,E(to),Q{to)) is dependent on the value of e, and the 
following proposition tells that the smaller e is, the larger 

H.f(j)f,E(to),Q(to)) will be. 


Proposition 3. For T* = 00 , if is a multiple of 
£“ with multiplier z G \ {!}, i.e., z£°‘ = e^ then 
'Hf(j)fa,E(to),Q{to)) D Hf(pft,E{to),Q(to)). 

Proof: See Appendix]^ ■ 

Proposition [^implies that the smaller e is, the larger the set 
T-Lf(pf,E{tQ),Q{to)) will be, which means the more cases 
of energy-harvesting rate H can result in a hnite transmission 
time. Flowever, due to the limited computational resource, we 
cannot make e arbitrarily small because smaller e leads to 
more frequent event triggers. In practice, we should balance 
the computational accuracy and efficiency. 


VI. Simulation Results 

Now, we present the simulation results to illustrate the 
benefit of the proposed transmission designs based on robust- 
optimal control. 

Since there are no comparable online algorithms in the 
literature, we take the following three designs in the event- 
trigger control framework as examples to compare with our 
design. The first is an estimation-based algorithm, labeled as 
p®, which estimates the future energy-harvesting rate based 
on the energy-harvesting rate in the past. The corresponding 
PTP in each event n has parameters: 


{Pn,e{t),Tn) = 


Pmax, r{p„,„„) J 

hall . E{t„) \ 

-I- Ap, J 



^bal ^ ^ , 

K ■ < 

— -^'■max5 


<ifn 


(25) 


where Ap = 0 for n = 0, and Ap = [E{tn) — E{tn-i)]/(tn — 
tn-i) for n > 0. Compared with the robust-optimal PTP 
design in pjjg has the same structure and the only 

difference is the second line in ( |25| l, where A p is the 
additionally allocated power in every event. We can see that Ap 
is dependent on the average energy-harvesting rate in the last 
event, i.e., — E{tn-i)]/{tn — fn-i)- The additionally 

allocated power takes into account the future energy arrival 
whose rate is estimated to be the same as that in the last event. 
In contrast, the robust-optimal design does not assume any 
future energy arrival within the current event. Therefore, the 
estimation-based design is smarter than our robust-optimal de¬ 
sign p^ when H happens to be a stationary process, since the 
average energy-harvesting rate [E{tn) - E(tn-i)]/(tn-tn-i) 
in the past can be a reasonably accurate estimate of the energy¬ 
harvesting rate in the future. The second design is the modihed 
estimation-based algorithm, labeled as p^, which has the same 
structure ( |25] ) to p^ , but with a different Ap that: Ap = 0 for 
n = 0, and Ap = 0.2f,[E{tn) - E(tn-i)]/{tn - fn-i) for 
n > 0. The modified estimation-based algorithm p^ is more 
conservative than pf because only a quarter of the estimated 
energy is used. The third design is the greedy algorithm, 
labeled by p'f, which simply transmits data with maximum 
power in every event n if E{tn) > 0 and Q{tn) > 0. 

We present simulation results in three scenarios. In the hrst 
two scenarios, the energy-harvesting rates are deterministic but 
totally unknown to the transmitter, and in the third scenario, 
the energy-harvesting rate is a non-stationary stochastic pro¬ 
cess whose statistics are totally unknown to the transmitter. 

In the section, the units of all parameters are normalized, 
and we assume fg = 0- 

Scenario 1. We set the initial battery energy as E(ff) = 
1, the initial data queue length as Q(0) = 1, the maximum 
transmit power as Pmax = 3, and the triggering threshold as 
£ = 0.05. The energy-harvesting rate is chosen as H{f) = h 
for t G [0, 0.2), and H(t) = h/10 for t G [0.2, 00 ), where h 
is selected from [0, 2]. The transmission ti me co mparison of 
Po. 05 ’ Po. 05 ’ Pao 5 ’ p ®05 is given in Fig. |5^ from which 
we can observe that: 
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1) The worst transmission time of p^oa 1- The worst- 
case energy harvesting rate is H{t) with parameter h G 
[0, 0.15], hence including Ho as a worst case. 

2) The worst transmission time of pf ga *^his hgure is 3.2. 
The worst-case energy harvesting rate is with parameter 
h = 0.5. Clearly, Ho is not the worst case. 

3) The worst transmission time of p^oa *^his figure is 1.95. 
The worst-case energy harvesting rate is with parameter 
h — 0.5. Clearly, Ho is not the worst case. 

4) The worst transmission time of pfoa The worst- 

case energy harvesting rate is Ho- 

We can see that the transmission time is guaranteed by our 
design to be not greater than 1, while the other three designs 
can result in a transmission time much larger than 1. 



(a) 



(b) 

Fig. 5. Transmission time comparisons: (a) T* < oo; (b) T* = oo. 


Scenario 2. We set the initial battery energy as E{0) = 0.2, 
the initial data queue length as (5(0) = 1, and the maximum 
transmit power as Pmax = 3. The energy-harvesting rate is 
chosen as H{t) = a| sinfj, for t G [0, 1], and H(t) =0 for 
t G [1, oo), where a is selected from [0, 5]. The transmission 
time comparison of p^g;^ (e = 0.01), Pg.oi = 0.01), p^qi 
(e = 0.01), p^gi (£ = 0.01), and p ^2 = 0-2) is given in 

Fig. 5(b) We can see that T* = T(p^, E{0),Q{0), Ho) = oo 
(for all e). When e = 0.01, for p^gi, the region of h 
having hnite transmission time is a G [1.1, 5], but for pf g^, 
p^g]^, or p[fgi, this region is much smaller (a G [2.7, 5], 
a G [2.6, 5] and a G [2.9, 5], respectively). Hence, if the 


actual energy-harvesting rate is with a — 2.5, the robust- 
optimal design p^j'^gi results in a transmission time of 1.33, 
while pf gi, P^gi and p^gj return an inhnite transmission time. 
Additionally, since 0.2 is a multiple of 0.01, Proposition]^ tells 
that Hf{pf oi,E{0) Q{0 )) D Hfip^2^E{0),Q{0)), which is 
also verihed in Fig. |5(b)| 

Scenario 3. We set the initial battery energy as E{0) = 1, 
the initial data queue length as (5(0) = 1, and the maximum 
transmit power as Pmax = 3. The energy-harvesting rate is 
chosen as a modihed compound Poisson process H{t) = 
Di{t)]'^, where {N(t): f > 0} is a Poisson process 
with rate A = 2, and Di{t) is a Gaussian random variable with 
mean a|sinf| (a G [0, 5]) and variance 1. The transmission 
time comparisons of p^, pf, and p^ for e = 0.01,0.05 are 
given in Fig. The performance of the greedy algorithm is 
not includ ed as it is much worse than all other algorithms. In 
Fig. 6(a) the worst-case transmission times of Pg gi and p^gi 
are 24.6% and 22.1% larger than that of Pg^g^, respectively. 
Similarly, in Fig. 6(b)| the worst-case transmission time of 
Pao 5 smaller than those of Pg gg and p^gg. Comparing 
Fig. 6(a) with Fig. 6(b) we can see that a larger triggering 
threshold actually reduces the worst-case transmission time for 
the estimation based algorithms pf and p^. This is because 
H (t) is highly non-stationary and a larger triggering threshold 
is less sensitive to the rapid and non-stationary fluctuation 
in H{t) which results in a better worst-case performance 
for the estimation-based algorithms. For our robust-optimal 
algorithm p^, the worst-case transmission times are the same 
for £ = 0.01 and e = 0.05, which coincides with our 
conclusion in Section IV-BI 


VII. Conclusion 

We have solved the transmission-time minimization prob¬ 
lems for an energy harvesting transmitter, where the future 
energy-harvesting rate is totally unknown. Specihcally, our 
design is based on two advanced methods in cybernetics: 

• Event-trigger control: The Event Detector (ED) contin¬ 
uously monitors the battery energy and triggers a new 
event when it experiences some significant change from 
the last event. Whenever an event is trigger, the Trans¬ 
mission Planner (TP) uses the current information of the 
battery energy and the data queue to update the transmit 
power based on robust-optimal control. The event-trigger 
control framework is summarized in Algorithm 

• Robust-optimal control: It minimizes the worst-case 
transmission time such that the actual transmission time is 
guaranteed to be below this level, no matter what energy¬ 
harvesting rate is imposed. If the worst-case transmission 
time is always infinite for all possible transmit power 
design, then the robust-optimal control guarantees the 
largest set of energy-harvesting rate to have a finite actual 
transmission time. The robust-optimal transmit power 
design is given in Theorem 

Eor future work, one can adopt the approach used in this 
work to design transmission protocols for other objectives, 
such as throughput maximization, with no knowledge on the 
future behavior of energy-harvesting rate and data arrival 
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(a) 



(b) 


Fig. 6. Transmission time comparisons: (a) e = 0.01; (b) £ = 0.05. 


i) Ai C A 2 : 'i[En{t),Qn{t)) € Ai, the RPE exists 
according to Lemma Hence, Kn = Since r{p) 

is strictly concave for p and r(0) = 0, 


riPu) _ r{Pe) - f(0) 


(28) 


is strictly decreasing in (0, Pmax]- Therefore, Ky^in < Kn < 
-^niax. and {Eny Qn^^ 

ii) A 2 C Ap. y^EmQn) S A 2 , the RPE exists according 
to (|^. Thus, {En,Qn) G Ai. 

To sum up, Ai = A 2 and •GUI holds. 


Appendix C 

Proof of Proposition[3 

Since r{pn^e{t)) = log 2 (l + Pn,e{t)) is strictly increasing, 
concave and non-negative, ^Pn,e{t) G [0, Pmax], r{pn,e{t)) 
can be expressed by 

r{Pn,e{t)) = K'^Pn,eit) + ^ “ 4 Pn,e(.t)), (29) 

where 6 is a positive constant, and AT' is the derivative 
of r{pn,e{t)) at Pn,e{t) = p®, wWch implies AT' := 

linip^ dr(p„,e(f))/dp„,e(t). In e{pn,e{t)) > 0, 

and e{pn,e{t)) = 0 holds only when Pn,e{t) = Pn- 

Recall the proof of necessity in Lemma [T] that if ( |T^ holds, 
then 0 is satished. We break down into 3 parts by 
Lebesgue integral. 


f A{t)dt + j A{t)dt + j A{t)dt = 0, (30) 

JSi J S 2 JS 3 


process. Additionally, it is interesting to see how the robust- 
optimal solution performs using experimental measurements 
(e.g., p9l ) of the energy-harvesting rate. 

Appendix A 
Proof of Lemma[T] 

Necessity. If ([T^ holds, there exists a f > such that 
is not empty, and we have 

Qitn)-Qn 

-^ = --?-•• (2o) 

E{tn) - En Pn,e{T)dT 

According to ( [T^ , equation ( |26] l can be further rewritten as 

[KnPn,s{T) -r{pn,e{T))]dT = 0. (27) 

If no RPE exists, then either KnPn,e{T) > f{Pn,e{j)) or 

KnPn,e{j) < r{pn,e{'r)) holds for [tn, K + Tn], which 

contradicts with ( |27| i. Therefore, the RPE exists. 

Sufficiency. If the RPE exists, we set Pn,s{t) = Pn the 
transmit power. Note that KnPn — f{Pn)^ hence ( [l4| ) 
holds. 


Appendix B 

Proof of Proposition[T] 

Eor the simplicity of this proof, we label Ai = 

^n\{iE{tn)jQ{tn))} and A2J= {(A„,Q„) : Kynin < Kn < 

iGmax, 0<En< E{tn),0 < Qn < Q{tn)}. 


where A(f) = [r{pn,e{t)) - KnPn,e{t)], Si = {t : Pn,e{t) < 
S 2 ={t: Pn,e{t) > Pn) S 3 = {t : Pn,e{t) = Pn}- 
Let p be the average transmit power over [f„, + T„], and 

we have 

_(a) !si Pn,e{t)dt + fPn,e{t)<^t + S S 3 
^ p{Si) + p{S 2 ) + p{S3) 

(b) Isy ^ iPn,s{t)) dt + fs 2 r (Pn,e(t)) dt -f r {pl) dt 
Kn[p{Si)+pi{S2)+p{S3)] 


( 31 ) 


where 


. ^ /s,£(Pn.e(0)dt + /g^e(:Pn.e(f))dt 

Kn[p{Si)+p{S2)+p{S3)] - ■ ^ ^ 

In ( [3 T] i, (a) represents the average power, and (b) is from ( [30l l, 
and (c) is derived by employing ( [29l ). In ( |32] l, the equality 
holds only when p {Si) = 0 and p {S 2 ) = 0, i.e., @ hold^ 
Prom ([3 T]i, the explicit form for p is 


b — A'„A ^ b 
Kn-K'^ - Kn-K'^ 


= Pn 


(33) 


^Mathematically, formula (13 holds almost everywhere (a.e.) in the dura¬ 
tion [tri, fn + ^n] except some sub-durations with zero measures. Practically, 
since the transmit power cannot change that fast, we exclude the zero-measure 
cases and the term a.e. is omitted. 
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Therefore, 


Tn — fj- (5'i) + /i (S' 2 ) + /i {S 3 ) 


E{tn) — E„ 


^ E{tn) En _ ^ 


(34) 


P Pn 

where the equality holds if and only if ( |T^ holds according 
to ( [ 3 ^ . Thus, ( |T9 ] i is the time-optimal transmission power. The 
uniqueness for ( [T9l l is obvious, otherwise ( |34l i cannot hold. 
Note that = r(p®), and hence ([20li is obtained. 


Appendix D 

Proof of Corollary[T] 
Taking the partial derivative of T_^\En,Qn 
E„, we have 




dT^[En,Qr, 

dE„, 


d 

dE„ 

pI 


Ejtn) - E„ 
Pn 


dp^ 

dE^ 


E{tn) - Er, 


{p: 


,ei2 


Employing the chain rule, we have 

dpf, dpi dK„ Pi 




dE„ dKn dE„ 




log2(l +p^) - 0 




- 0 


Kl = lim 
P^Pf. 


\ 0 g 2 {l+pl)-p 


Pn-P 


dT^[En,Qn 

dE 


KL 


pl{K^-Kl) 


> 0 , 


Lemma 2. If the initial battery energy i?(fo) initial data 
queue Q{to) satisfies Q{to)/E{to) = < ATmax. then the 

following holds 

sup T{pf,E{to),Q{to),H) = Tipf,E{to),Q{to),Ho) < 00 . 

(39) 

Proof: Firstly, it can be easily obtained that 


w.r.t. 


(35) 


(41) 


(36) 


Kl - E{tr,) - E^ 

where dpl/dKn is derived by using the implicit differenti¬ 
ation on KnPl = log 2 (l + pI), and dKn/dEn is obtained 
by ([T2|. In ( ( 3 ^ , Kl is the derivative of log 2 (l + pI), i-e., 
Kl := l/[ln2(l +p®)]. Since and K'^ can be rewritten 
as 


(37) 

we know > K'^ due to p® > p > 0. By ( |3^ , equation ( |35l ) 
can be rewritten as 


(38) 


where (a) follows from > K'^. Thus, with fixed Q^, 
the planned transmission time T„ [A„, Q„] increases with En- 
This means that: If < ATmin, the minimum 
achieves at point b in Fig. |4(a)| and we can calculate the battery 
energy of point b as E{tn) — Q{tn)/K^in by the equation 
of Li (see the caption in Fig. |^, which implies ( |2T] i holds. 
Since in Fig. 4(a) is with slope ATmin, from Fig. we 
have pI = p^ax- Similarly, if AT^in < AT^’"' < AT^ax, we can 
derive ( |22l l and ( |2^ . ^ 

For A:^^' > ATinax, we cannot find any end point with Qn = 
0, according to Fig. |4(c)| 

Appendix E 
Proof of Theorem[T] 

Before starting, we give two lemmas: Lemma|^tells that the 
worst-case energy-harvesting rate of the designed PTP p^^ is 
Ho : t I—>■ 0 {t G [to, 00 )) if the reachable set IHo is not in 
energy-scarce case; Lemma indicates when T* is finite. 


sup T{p, ,E{to),Q{to),H) > T{p^,E{to),Q{to),Ho). 
Hen 

(40) 

Secondly, for energy-harvesting rate Ho, there are no triggered 
events, since no energy comes in [to, 00). For H Ho, 
there should be fc G Z+ triggered events. If fc = 0, then the 
transmission time will be the same as that for Ho- If fc > 0, 
then in the first triggered event, then E{ti) is 

f E{to) - {ti - fo)Pmax + e, < ATmin, 

\ E{to) - ih - + e, K^in < < K, 

If such calculated E{ti) < 0, then the transmission time is 
still the same as that for Ho- If E{ti) > 0, then ATj”^' is 

{ Q(to)-(*l-*o)r(Pniax) ^ ^bal T^bal ^ tp' 
B(io)-(ti-to)Pmax-Pe - ^0 > ^0 ^ 

Q(to) —(tl—to)»’(pQ“'') ^ T^bal 7^ ^ T^bal ^ 7^ 

£;(io)-(ti_to)pbai+5 ^ 0 > -^min ^ ^ ^max- 

(42) 

According to Fig. that p® decreases w.r.t. AT^, we have 
Therefore, the transmission time becomes smaller. 
To sum up, ViT G H the following inequality holds 

T{pf,E{to),Q{to),H) < T{pf,Eito),Q{to),Ho), (43) 

which means 

sup T{pf,E{to),Q{to),H) <T{pf,E{to),Q{to),Ho). 
hgu 

(44) 

Now, combining inequality ( |44| ) with inequality and 

noticing that T{p^,E{to),Q{to),Ho) < oo (since the cor¬ 
responding reachable set for n = 0 is energy-abundant or 
energy-balanced), we can finally derive 


Lemma 3. T* < oo if and only if Q{to)/E{to) = ATg®'^ < 
K 

J'-max* 

Proof: Necessity. By contrapositive, we should prove that: 
if ATg*^^ > ATmax, then T* = oo. Under Ho '■ t i-G 0 {t G 
[fo, oo)), we have T(pe, E {to), Q {to), Ho) = oo for any p^ G 
'Pe, which implies 


T* = inf sup T(pe, A(fg),Q(fg),iT) 
PeSTe H^H 

> inf T{pe,E{to),Q{to),Ho) = oo. 

Pe&Ve 


(45) 


Therefore, T* = oo. 


Sufficiency. We prove that if ATg®^' < ATmax, then T* < oo. 


Firstly, we have 


T* = inf sup T{pe,E{to), Q{to), H) 
Pe6Te Hen 

< sup T{pf, E {to), Q {to), H), 

Hen 


( 46 ) 
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which combined with < ifmax and Lemma implies 

that 

T* < sup T{pf,E{to),Q{to),H) 

H^n 

< Tipf,E{to),Q{to),Ho) < oo. 

Now, we start the proof of Theorem [T] 

We divide this proof into two parts corresponding to T* < 
oo (see Subproblem [T]) and T* = oo (see Subproblem |^, 
respectively. 

i) For T* < oo, from Lemma we know < iTmax 
holds. 

Let the optimal solution be p* ^ (implementing p*) and the 
corresponding worst-case energy-harvesting rate be i.e., 
T* = Tipt,E{to),Q{to),Hp.). We have 

T{p*„E{to),Q{to),Hp,J=T* 

< sup Tipf,E{to),Q{to),H) = T{pf,E{to),Qito),Ho), 
H^n 

(48) 

where (a) follows from Lemma since for < iTmax, 

the worst-case energy-harvesting rate of pf is Ho. 

On the other hand, we can derive 

T{pt, E{to),Q{to), Hp.) > T{pl,E{to),Q[to),Ho) 

ib) (49) 

> T{pf,E{to),Qito),Ho), 

where (&) follows from the fact that is time optimal under 
Ho- This is because, in this case, event 1 is never triggered, and 
the FTP is always equal to the actual transmit power [see Q], 
which means that: for < iTmax, the actual transmit power 
is optimal according to Corollary[T] Therefore, combining ( |48l ) 
and ( |49l l, we have p* = p^, i.e., Pne= Pn.e- 

ii) For T* = oo, from Lemma [s] we know > Ffmax 
holds. 

We will show pf- satishes © in Subproblem 1^ by 



amount of harvested energy from to to tg + E that can be 
seen from the TP side. 

If we assume the amount of battery energy E(to) + Ne 
is available at to, then p^ is still be a valid transmit power 
design to clear the data queue Q{to), and the corresponding 
transmission time is also the same, i.e., T = T{pe,E{to) + 
Ne, Q(fo)7 ^o)j^ With Corollary[^ we know that p^ is time- 
optimal, i.e., T{pf,E{to) + Ne,Q{to),Ho) < TiPe,E{to) + 
Ne,Q{to),Ho) < oo, which implies 


Qjtp) 

Eitp) ~\~ Ne 


<K, 


_ -‘'■max- 


(53) 


Then, we prove T{pf,E{to),Q{to),H) is hnite. In event 

0 : 

1) If = Q(to)/E(to) < Ffmaxj then the transmission 
time is hnite, because even if no energy arrives in 
[fo, oo), the transmitter can still clean up the data queue 
by 

2 ) If = Q{to)/E{to) > iTmax, then from the 
transmit power is 0 during [tg, L). 

If case 2) happens, then in event 1, we have = 

Q{to)/{E{to) + e). Similar to event 0, if < Ffmax, then 
the transmission time is hnite. Otherwise, the transmit power 
is 0 during [ti, t2), which makes = Q{to)/{E{to) + ‘2e). 
Proceeding forward, if case 1) happens, then the transmission 
time is hnite, and if case 2) happens, the transmitter transmit 
nothing. Here, we can always hnd an event n < N such that 
= Q{io)l{E{to) + ne), if > iTmax- This is guar¬ 
anteed by ( |5^ . Now, we have T{pf, E{to),Q{to), H) < oo, 
and thus H e 'Hf{pf,E{to),Q(to)). 

To sum up, for any p^ e ViF G Hf{pe,E{to),Q{to)), 
we have H G 'Hf{pf^,E{to),Q{to)), which implies that 
'Efipe,E{to),Q{to)) Q nf{pf,E{to),Qito)) holds for all 
Pe e 


Appendix F 

Proof of Proposition!!] 


nfip„Eito),Q{to))cnf{pf,Eito),Q{to)), ype&^e, 

(50) 


or equivalently, for all p^ G ViT G 'Hf(pe,E{to),Q[to)), 
H should also be in the set 'Hf{p^,E{to),Q{to)). 

Now, Vpe G ‘Pe, and ViT G 'Hf[pe,E{to),Q{to)), we have 

/ log 2 (l+P£(r))dT = Q(fo), (51) 

Jto 

where T = T{pe, E{to), Q{to), H) < oo. Since function H is 
Lebesgue integrable over any subset of IR+ with hnite measure, 
the integral H{t) dr is hnite, which means the total 

number of triggered events is also hnite. Assuming event N 
is the last triggered event, we have 


/ Pe{T)dT = E{to) + Ne, 

Jto 


(52) 


in which the value e is the harvested energy in each event, 
since the integral of H{t) is continuous and the event is 
triggered when ^H{T)dT = e. Thus, Ne is the total 


ViT G 'Hf{p^t, E{to),Q{to)), the transmission time is = 
T{pfi,,E(to),Q{to),H) < oo. Then, similar to part ii) in the 
proof of Theorem we label the last triggered event number 
as N^, and by @7 the following holds 

j-to+E 

N^e^ < / H{T)dT < {N^ + 1)£^ (54) 

Jto 

Since is hnite, similar to ( |53] l, Q{to)/{E{to) + N^e^) < 
Ffmax holds. Noting that ze“ = e^, we have N'^e'^ = N°‘e°‘, 
where N°‘ = zN^. Thus, the following holds 


Qjto) 

E{to) + Af“£“ 


Qjtp) 

E{to) + N^eb 


<K„ 


(55) 


which implies that for £“, the transmission time 
T{pfa,E{to),Q{to),H) is hnite (similar to part ii) in the 


* After moving the future energy to to. the transmission time should be 
'T(P£, E{to) + Q(to), i^ieft) where i^ieft is an equivalent energy¬ 
harvesting rate but not necessarily Hq- However, since .f/ieft cannot trigger 
any event, we have T(p£,-E(io) + A/'e, Q(£o),-H'left) = T(p£,-E(£o) + 
Ne,Q{tQ),Ho) 
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proof of Theorem[^. Therefore, H S 'Hf{pfa,E{to),Q{to)), 
and nf{pfa,E{to),Q{to)) D nfipf,,E{to),Q{to)). 

On the other hand, we can find a iJ € 
Ufipfa,E{to),Q{to)) such that 

pOO 

(z — l)e“ = / E[{t)At < ze°‘= , (56) 

Jto 

which implies is infinite, since for e^, the first event never 
comes, and > iTmax will stay unchanged for [fo,c»). 

Hence, Hfipfa,E{to),Q{to)) 2 'Hf{pfb,E{to),Q{to)), and 
therefore, combining with the result in the last paragraph, we 
have Hf{pfa,E{to),Q{to)) D 'Hf{pft,E{to),Q{to)). 
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